Robust Speech Feature Extraction Using the Hilbert Transform Spectrum Estimation Method

نویسندگان

  • Huan Zhao
  • He Liu
  • Kai Zhao
  • Yong Yang
چکیده

The performance of traditional mel-frequency cepstral coefficients (MFCC) speech feature extraction method decreases drastically in the complex noisy environment. To improve the performance and robustness of speech recognition system, which is based on spectral envelope estimation method, the minimum distortionless response spectrum MVDR-MFCC (Minimum Variance Distortionless Response-MFCC) feature extraction method was proposed. However, the computational complexity of MVDR-MFCC is very high. In this paper, we proposed MHCC (Hilbert-MFCC) feature extraction method for speech, which introduced the Hilbert transform to MFCC process. The experiments, under 8 different noisy environments, indicate that, compared with MVDR-MFCC feature extraction method, the proposed method not only reduces the algorithm’s complexity significantly, but also is less affected by noises, achieving significant improvement in the robustness—the average recognition rate across different noise types and SNRs increases by 12%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition

In this paper, we present two robust feature extractors that use a regularized minimum variance distortionless response (RMVDR) spectrum estimator instead of the discrete Fourier transform-based direct spectrum estimator, used in many front-ends including the conventional MFCC, for estimating the speech power spectrum. Direct spectrum estimators, e.g., single tapered periodogram, have high vari...

متن کامل

Robust Feature Vector Set Using Higher Order Autocorrelation Coefficients

In this paper, a feature extraction method that is robust to additive background noise is proposed for automatic speech recognition. Since the background noise corrupts the autocorrelation coefficients of the speech signal mostly at the lower orders, while the higher-order autocorrelation coefficients are least affected, this method discards the lower order autocorrelation coefficients and uses...

متن کامل

Improved MFCC Feature Extraction Combining Symmetric ICA Algorithm for Robust Speech Recognition

Independent component analysis (ICA), instead of the traditional discrete cosine transform (DCT), is often used to project log Mel spectrum in robust speech feature extraction. The paper proposed using symmetric orthogonalization in ICA for projecting log Mel spectrum into a new feature space as a substitute in extracting speech features to solve the problem of cumulative error and unequal weig...

متن کامل

MVDR based feature extraction for robust speech recognition

This paper describes a robust feature extraction method for continuous speech recognition. Central to the method is the Minimum Variance Distortionless Response (MVDR) method of spectrum estimation and a feature trajectory smoothing technique for reducing the variance in the feature vectors. The above method, when evaluated on continuous speech recognition tasks in a stationary and moving car, ...

متن کامل

Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition

In this paper, a feature extraction method that is robust to additive background noise is proposed for automatic speech recognition. Since the background noise corrupts the autocorrelation coefficients of the speech signal mostly at the lowertime lags, while the higher-lag autocorrelation coefficients are least affected, this method discards the lower-lag autocorrelation coefficients and uses o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011